CS - 621 Theory Gems November 14 , 2012 Lecture 16 Lecturer : Aleksander Madry Scribes : Chidambaram Annamalai

نویسندگان

  • Aleksander Madry
  • Chidambaram Annamalai
  • Yann Barbotin
چکیده

In the last lecture, we introduced the streaming model. The goal of this model is to help us develop algorithms that are useful in the scenarios where we want to process large amounts of data while having only very limited storage (and processing time) at our disposal. The motivating example here is a network router. This device has an extremely small memory compared to the network traffic that flows through it (in particular, it would be hopeless for it to try to story all that traffic in its memory). However, we still would like it to be able to compute some useful statistics of that traffic (e.g., number of distinct IP destinations requested, the most frequently requested IP address). Of course, at first, one might expect this to be an impossible task, but – as we will see in coming lectures – quite often one can obtain pretty satisfactory solutions. Formally, in the streaming model (cf. Figure 2), we view the data as a long vector – a stream – y = (y1, . . . , yn), where the length n of the stream is known in advance (and very large). The elements yi of this stream belong to a fixed universe U of size m and (without loss of generality) we will assume that U := {1, 2, . . . ,m} and that m ≤ n. Now, the way we can access the data is quite restricted, we are only allowed to have one pass over stream. So, a streaming algorithm during this pass has to accumulate in its (limited) memory, enough information to be able to compute the desired statistic of the data represented by the stream.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Lecturer : Aleksander Mądry Scribes : Chidambaram Annamalai and Carsten Moldenhauer

We continue our exploration of streaming algorithms. First, we briefly discuss the sparse approximation problem – a problem of fundamental importance in streaming and signal processing – and its connection to the Count-Min algorithm. Next, we start investigation of the lowerbounds on space complexity of streaming algorithms. In particular, we revisit the question of lowerbounds for the distinct...

متن کامل

Cs-621 Theory Gems

In Lecture 10, we introduced a fundamental object of spectral graph theory: the graph Laplacian, and established some of its basic properties. We then focused on the task of estimating the value of eigenvalues of Laplacians. In particular, we proved the Courant-Fisher theorem that is instrumental in obtaining upper-bounding estimates on eigenvalues. Today, we continue by showing a technique – s...

متن کامل

Cs-621 Theory Gems

That is, we want the hyperplane corresponding to (w, θ) to separate the positive examples from the negative ones. As we already argued previously, wlog we can constrain ourselves to the case when θ = 0 (i.e., the hyperplane passes through the origin) and there are only positive examples (i.e., l = 1, for all j). Last time we presented a simple algorithm for this problem called Perceptron algori...

متن کامل

Principles of Data Analytics

These are abridged lecture notes from the Spring 2017 course offering of “Principles of Data Analytics” that I offer at Iowa State University annually. This graduate level course offers an introduction to a variety of data analysis techniques, particularly those relevant for electrical and computer engineers, from an algorithmic perspective. Topics include techniques for classification, visuali...

متن کامل

CS 2429 - Foundations of Communication Complexity Lecture # 8 : 7 November 2012 Lecturer : Lila Fontes

Today we’ll cover some recent results from the paper Lower bounds on information complexity via zero-communication bounds and applications by Kerenidis, Laplante, Lerays, Roland, and Xiao (FOCS 2012). We’ll also recall results covered in past lectures from the papers The partition bound for classical communication complexity and query complexity by Jain and Klauck (CCC 2010) and How to compress...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012